Exploring infield hedgerow loss using CV to read historic OS maps

Machine Learning
Maps
Author

William Ward

Published

February 7, 2024

I am putting together a new approach to managing the restoration of hedgerows in actively cultivated fields using a computer vision model that identifies change from early 20th century Ordinate Survey maps (second edition, six-inch map series). What makes it ‘field-level’ is the use of a relatively new dataset produced by researchers working with the UK Center for Ecology & Hydrology that is produced to map information of the crop production for individual fields (source to the data). I have essentially cookie-cuttered out chunks of the old maps using the polygons from this dataset to act as individual images into a convolution neural network (CNN). I have managed to condense the entire process into a single-document pipeline that goes through every step of:

  1. Querying the Library of Scotland’s database of historic OS maps (using MapReader Python package)
  2. Downloading these maps onto the computer’s hard drive (Using MapReader)
  3. Georeferencing them based on corner coordinates (using GDAL library)
  4. Joining maps into one large raster(GDAL)
  5. Not in pipeline yet: splitting the raster by the polygons of the Crop Plus data.
  6. Load in individual patches produced in raster split into MapReader’s annotation tool.
  7. Train ResNet-18 (pre-trained on OS maps by Houssini et al., 2023) CNN

Here is a visualisation of this pipeline:

Results:

Here are the county-level results after bad predictions were omitted and the Yorkshire study site was removed (predictions were entirely incorrect):

I then did some analysis on the difference between the size of fields that had and had not experienced hedgerow loss, and he difference between arable vs pastoral fields. The results (below) were as expected, with the largest fields experiencing hedgerow loss because of their dilation by removing boundaries, whiile arable farmland experienced greater instances of hedgerow loss. The latter was expected because arable land is more prone to expansion for bigger fields that can be managed more easily by bigger machinery. These findings were less about making testing these prior assumptions, and more about seeing if the model’s predictions aligned with them for verification of success.

After these results were obtained, I broadcasted the resulting predictions onto the polygon features to visualise the results in ArcGIS Pro. I will publish this as an interactive map in due course.

Appendix (table of finer details of the county-level results):